Design and optimization of DBSCAN Algorithm based on CUDA

نویسندگان

  • Bingchen Wang
  • Chenglong Zhang
  • Lei Song
  • Lianhe Zhao
  • Yu Dou
  • Zihao Yu
چکیده

DBSCAN is a very classic algorithm for data clustering, which is widely used in many fields. However, with the data scale growing much more bigger than before, the traditional serial algorithm can not meet the performance requirement. Recently, parallel computing based on CUDA has developed very fast and has great advantage on big data. This paper summarizes the algorithms proposed before and improves the performance of the old DBSCAN algorithm by CUDA. The algorithm uses shared memory as much as possible compared with other algorithms and it has very good scalability. A data set is tested on the algorithm of new version. Finally, we analyze the results and give a conclusion that our algorithm is approximately 97 times faster than the serial version.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An approach to Improve Particle Swarm Optimization Algorithm Using CUDA

The time consumption in solving computationally heavy problems has always been a concern for computer programmers. Due to simplicity of its implementation, the PSO (Particle Swarm Optimization) is a suitable meta-heuristic algorithm for solving computationally heavy problems. However, despite the simplicity, the algorithm is inefficient for solving real computationally heavy problems but the pr...

متن کامل

بررسی مشکلات الگوریتم خوشه بندی DBSCAN و مروری بر بهبودهای ارائه‌شده برای آن

Clustering is an important knowledge discovery technique in the database. Density-based clustering algorithms are one of the main methods for clustering in data mining. These algorithms have some special features including being independent from the shape of the clusters, highly understandable and ease of use. DBSCAN is a base algorithm for density-based clustering algorithms. DBSCAN is able to...

متن کامل

Parallelization of Rich Models for Steganalysis of Digital Images using a CUDA-based Approach

There are several different methods to make an efficient strategy for steganalysis of digital images. A very powerful method in this area is rich model consisting of a large number of diverse sub-models in both spatial and transform domain that should be utilized. However, the extraction of a various types of features from an image is so time consuming in some steps, especially for training pha...

متن کامل

Improvement of density-based clustering algorithm using modifying the density definitions and input parameter

Clustering is one of the main tasks in data mining, which means grouping similar samples. In general, there is a wide variety of clustering algorithms. One of these categories is density-based clustering. Various algorithms have been proposed for this method; one of the most widely used algorithms called DBSCAN. DBSCAN can identify clusters of different shapes in the dataset and automatically i...

متن کامل

Design Optimization of Axial Flux Surface Mounted Permanent Magnet Brushless DC Motor For Electrical Vehicle Based on Genetic Algorithm

This paper presents the design optimization of axial flux surface mounted Permanent Magnet Brushless DC motor based on genetic algorithm for an electrical vehicle application. The rating of the motor calculated form vehicle dynamics is 250 W, 150 rpm. The axial flux surface mounted Permanent Magnet Brushless DC (PMBLDC) motor was designed to fit in the rim of the wheel. There are several design...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1506.02226  شماره 

صفحات  -

تاریخ انتشار 2015